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(57) ABSTRACT 

The invention provides a technique for generating minimal 
debugging type information in a distinguished compile unit 
while still supporting the debugging of classes for which the 
distinguished compile unit is not, in this module, being built 
or debugged, where the debugging information for the 
classes has been previously compiled into one or more 
object modules. On detecting a reference to a type in a 
program, the compiler inserts code into the module it is 
compiling to direct the linker where to locate the debugging 
information object module describing that type. Usually, the 
object module will be located in a separate link library, so 
the compiler directs the linker to add the debug library to the 
list of libraries from which it tries to resolve symbolic 
references, and to add a reference to an external linkage 
symbol defined in the object module describing the type. 
'Hie linker will have lo resolve this symbol by adding the 
debug library into the link. 
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MINIMIZING DEBUG INFORMATION FOR 
GLOBAL TYPES IN COMPILED 
LANGUAGES 

CROSS-REFERENCE TO RELATED 
APPLICAnONS 

The present invcDtion is related to U.S. patent application 
Ser. No. 08/992.876, entiUed "A BINARY UBRARY WITH 
DEBUGGING SUPPORT," filed Dec. 17, 1997, and 
assigned to the assignee of the present application. 

FIELD OF THE INVENTION 

The present invention is directed to an improved tech- 
nique for providing minimal debugging information in an 
efBcienl compilation time. 

BACKGROUND OF THE INVENTION 

Object-oriented programs are programs written in pro- 
gramming languages such as C++ that support a particular 
form of user-defined data types called "classes". A class 
"declaration" in the programming language specifies the 
data contained by variables of the class type, or operations 
supported by those variables, or both. Variables or instances 
of these class types are caUed objects. 

A feature of C++ programming, and similar object ori- 
ented languages, is that its classes are stored in separate files, 
often grouped together into libraries. The source code that 
defines a class is commonly separated into a declaration part, 
contained in a "header file", and an implementation part, 
contained in a "body file". The header files contain impor- 
tant preprocessor directives; the preprocessor directive 
#include permits the header file to be referred to by the body 
file, after which declarations in the header file may be used 
in the body file. A header file will be referenced by its 
corresponding body file, but may also be referenced by one 
or more other body files that make use of the defined class. 

A compiler converts the source language contained in a 
body file, plus all referenced header files, into an object 
module file containing machine code. An executable pro- 
gram is formed by a linker or linking loader which combines 
the object modules generated from several body files, and in 
doing so, resolves references from one object module to 
symbols representing subroutines and data definitions found 
in another. 

While some errors in programs are so fundamental that 
they hall compilation, more often logical errors prevent the 
program from producing the results expected. Therefore, 
modern compiler suites generally include a software tool 
called a debugger that can be used to diagnose the program 
and trace these errors. To support the debugger, the compiler, 
under the control of an option, produces information 
describing the symbols and types in the program as well as 
information to map between source lines and the binary code 
output. This extra information enables the programmer to 
examine the types, variables and data structures by name and 
to follow the execution of the program through the source 
code. Current compilers, such as IBM's VisualAgeJ C++ for 
OS/27 Version 3.0, generate debug information in the fol- 
lowing naive fashion. For every type referenced in the 
source for the compilation unit, a full description of that type 
is provided in the debug type information that is included in 
the resulting object module. If the same type is referenced in 
multiple body files, then a copy of that type description will 
be generated in each of the object modules. This duplication 
results from the fact that the compiler processes the body 
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files one at a time, and therefore does not know whether a 
needed type description will be generated in some other 
object module. Because of the size of the debug information, 
this duplication can result in massive executable module 

5 sizes where the size of the debug information dwarfe all 
other aspects of the module. In addition, significant compile 
resources (time, working set, etc.) are devoted to the creation 
of this debug information so that widespread duplication 
represents a large degradation in the compile time needed to 
build the executable module. 

The prior art contains two approaches to ameliorating the 
module size and compile time problems. 

One approach is to enhance the linker (or create a post 
link utility) to determine when multiple local type descrip- 
tions from different object modules are describing the same 

"'^ type and create a single global version of the type descrip- 
tion (Global refers to the fact that it is accessible beyond the 
scope of a single object module's debug information). The 
link utility eliminates the duplicate local type descriptions 
and remaps all references to the global version of the type 

20 description. This approach solves the executable module 
size problem, but the object module size problem remains. 
Also, the compile time problem may actually be exacerbated 
by the link time cost of packing the debug type information. 
The second approach is to enhance the compiler to emit 

25 full type descriptions only in the "distinguished compile 
unit" for that type. A heuristic commonly used to select a 
distinguished compile unit for a class is described in The 
Annotated C++ Reference Manual by Ellis & Stroustmp, 
1990. The compile unit that contains the implementation of 

30 the lexically first non-inline virtual function member in that 
class is used as the distinguished compile unit. In other 
compile units that must reference the type that is fully 
described in the distinguished compile unit, a degenerate 
description of the type is emitted. The degenerate reference 

35 is a debut type record that does not describe the type but does 
provide a unique identifier for the type. By relying on the 
single definition rule in C++ the debugger and/or the linker 
is able to replace references to the incomplete type with 
references to the full type description. 

While this technique solves the compiler time and disk 
space problems, it is unable to handle a very common class 
of applications, those that use binary class libraries or 
classes implemented in code (dll files) that are dynamically 
linked without debugging information. Since the prior art 

45 method depends on emitting the full type information only 
in the distinguished compile unit for the type, it cannot 
produce a debuggable application when the source code of 
the distinguished compile unit is not part of the user's build 
process. 

50 Class libraries currently shipped in binary do not usually 
include debugging information with them because their 
producers assume that the header files shipped with the 
libraries provide enough information to build debugging 
information using a standard compiler and debugger. 

55 Furthermore, full debug information for the library source 
that could be used in compiling debugging information for 
an application would not be limited to the information 
required to describe types. The source information would 
expose other information about the implementation of the 

60 class library that producers could be unwilling to make 
generally available to customers in the absence of source 
code licenses. 

SUMMARY OF THE INVENTION 

65 It is therefore an object of the present invention to provide 
a technique to improve compilation in avoiding the compi- 
lation of duplicate debugging information. 
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It is also an object of the present invention to provide a According to the preferred embodiment of the present 

mechanism for emitting minimal debug type information in invention, an additional mode for compiling objects con- 

a distinguished compile unit while still supporting the taining debugging information generates debugging infor- 

dcbugging of classes for which the distinguished compile mation for all types whether they are referenced or not and 

unit is not, in this module, being built and debugged. 5 regardless of whether the unit being compiled is the distin- 

Accordingly. in one aspect, the present invention provides ^^^^^^ ^'"P"^ ^^^^ *yP^- 

a method for emitting unique debugging type information ^^^^^^ *his, the builder of a binary class library can: 

providing full debugging support for types implemented in 0 create an object file containing all debug information 

a binary library class. During compilation, a directive to a full range of types implemented and exported in 

linker is emitted to add a library containing an object file lo the class library; and 

with the debugging type information. A reference to an "> "™°^^y ^^^^ ^^^^ '^^^^n*^^ interface to 

external symbol defined in the object file is also emitted. "^^"^ class library so that the debugging infor- 

Then. for each type, a full description of the type is emitted "^^^^^ .^^^^^ f ^^^y^ available whenever 

in the distinguished compile unit for that type, while degen- implemented in the class hbrary are referenced in 

, , c ,u 4 -4. J- \u 1 ^c the user's source code during compilation, 

era e descriptions for the type are emitted in other compUe 15 .^^^ ^ concurrently-filed U.S. patent 

uniK Dunng linking, a debugging index packing tool is application titled A BINARY CLASS LIBRARY WITH 

enabled. The reference to the external symbol is resolved by DEBUGGING SUPPORT Sen No. 08/992.876. A preferred 

Unking the object file. Ihe degenerate descriptions are ^lethod for achieving this is illustrated in the flow diagram 

resolved to fuU descriptions of the type obtained from the of i^q j ^n^i discussed below. 

The invention also provides a compiler adapted to com- describe the interface to users of the class library, are 

pile object oriented program debugging type information, modified by adding a source text to cause the compiler to 

including debugging type information for types imple- direct a linker toward a library with the object module 

mented in a binary class library. This includes means to emit containing full debugging information for the class library 

a full description of a type in a distinguished compile unit for and to create a reference to an external symbol defined in the 

the type and otherwise to emit degenerate descriptions of the object module. This source could be: 

type referenced in the program, means to direct a linker to [) a pragma that will direct the Unker to add the static link 

add to its hnk Hbranes a library containing fuU debugging library containing the object module with the full debug 

type information for the binary class library, and means to type information for the types implemented in the class 

du-ect the linker to resolve the degenerate descriptions by library into the link whenever the header file is included 

accessing the hbrary containing the fuU debugging type in the user^s source; and 

information for the binary class. jj) ^ reference to an externally defined symbol (to be 

In a further aspect, the mvention provides a computer defined in the object containing the full debug 

program product that consists of a computer usable medium information) that will force the linker to add the object 

having new, useful and non-obvious combination of com- file containing the full debug information, 

puter readable program code means embodied thereon to For example, in a C++ class library of the preferred 

program a general purpose computer to perform the method embodiment, inserting the following code fragmem into 

steps set out above. each header file wiU accomplish the desired result: 

BRIEF DESCRIPTION OF TIIE DRAWINGS 40 defined (_MIN_DEBUG_) 

/pragma library(dbuginfo.lib) 

Embodiments of the invention will now be described in ^ reference(ClassLibraryTypelnfo) 

detail in association with the accompanying drawings, in jtendif 
which' 

. ^ In this code fragment, _MIN_DEBUG_ is a macro that 

HG. 1 IS a flow diagram illustrating steps that could be ^5 ^ reserved by the compiler and defined when the minimized 

taken by a class library builder or by a user of the class debugging information mode is activated, 

hbrary for creating an object module containing full debug- "#pragma library" line causes the compiler to embed 

ging information for all types in a hbrary; ^ control directive in the object module which in turn directs 

FIG. 2 is a flow diagram illustrating the steps taken in the the linker to add a library (dbuginfo.lib, in this case) to the 

compiler to generate minimal debugging information to the 50 list of libraries from which it tries to resolve symbolic 

linker, according to the preferred embodiment of the inven- references. The added library would contain the object 

tion; and module where the full debug information can be found, 

FIG. 3 is a flow diagram illustrating the steps taken in the The pragma reference direcLs the compiler to put a ref- 

linker to generate a single set of comprehensive debugging erence to the named symbol (ClassLibraryTypelnfo, in this 

information from the minimal debugging information 55 case) into the object module so that the liiier wifl be forced 

records provided by the compiler using debugging informa- to include the debug object which defines that symbol, in 

tion provided with a class library, according to the preferred order to satisfy the reference. 

embodiment of the invention. Second, a body file is created to #include the header files 

DETAILED DESCRIPTION OF TIIE nt'Tu'''^ r'^'l^f j^^^^^ 'y^^^^] 
PRPFFRRFH FMROniMPisnx; ^° ^^^"^^ ClassUbraryTypelnfo in the code fragment set out 
PREFERRED EMBODIMENTS ^^o^^) referenced in the header files (block 12). The body 
As discussed above, conventional compflers either pro- file is compiled using the option in the compiler for pro- 
vide full debugging information for referenced classes, or ducing full debugging information, the option that creates 
debugging information is emitted for referenced classes only type descriptions for types even if they are not referenced 
if compiling the "distinguished compile unit". If the header 65 (block 14). 

files contain material that is never referred to, no debugging 'llie resulting object module is put into a static library 

information is generated for it. named to agree with the name given in the #pragma libraryQ 



03/29/2004, EAST Version: 1.4.1 



us 6,226,786 Bl 



20 



25 



Statement of the co6q fragment (block 16). This can be an 
existing link library or a separate link library, at the discre- 
tion of the provider of the binary class library. 

Once the full object debug file is accessible, the minimum 
debug feature of the present invention can be implemented. 5 

According to the method of the invention, when the 
compiler is compiling the header files that describe the 
classes in the class library in the minimum debug mode, it 
will compile the #pragma libraryQ and pragma reference 
statements. The compiler will then put directives in the ao 
object module that tell the linker to find dbuginfo.lib and to 
resolve a reference to the symbol ClassLibraryTypelofo. 
When the linker processes the object so produced, it will be 
forced to add the object module containing the full debug 
information to the link. is 

This is illustrated in the flow diagram of FIG. 2. 

The compiler is enhanced to operate in the minimum 
debugging mode on detecting the command line switch 
(block 20). In the absence of that, it will compile debugging 
code in the conventional manner discussed above, emitting 
type information in each compile unit that refers to that type 
(block 22). The link will also be according to convention 
(block 24); since there are no references to the debugging 
information library for the class library nor to any symbol 
that is defined in that library, the link does not result in errors 
and all debugging information comes from the objects 
themselves. 

When the switch for implementing minimum debugging 
information is detected (block 20), the compiler first defines 
the preprocessor macro _MIN_DEBUG_ (block 26). 
Because this macro symbol is defined, the code fragment 
added to each header file by the class library builder causes 
a reference to "ClassLibraryTypelofo" to be added to each 
module created by the compiler that includes one of these 
header files, and a directive is passed to the linker, via the 35 
object module, to make it add the "dbuginfo.lib" library to 
its list of default libraries in the link. The references to 
"ClassLibraryTypelnfo" are satisfied by the linker when it 
brings in the object module containing the complete debug 
information descriptions for the class library (built by the 40 
class library builder as described in FIG. 1) which also 
contains the definition of the symbol "ClassLibrary- 
TVpelnfo". 

For teach reference to a class in the program, the compiler 
determines whether the module it is compiling is the dis- 
tinguished compile unit for the class (block 28, FIG. 2). If 
it is not, the compiler generates only degenerate debug 
information type records (degenerate references) to describe 
the class (block 30). If the class is the distinguished compile 
unit, the compiler creates complete debug information 
records to describe the class and all its members (block 32). 

ITiis process continues (blocks 34, 36) until the compi- 
lation is complete and the records are emitted to the linker 
(block 38). The debug packing feature in the linker matches 
up the degenerate references to types in the compile units 55 
with the full descriptions of those types found in the full 
debug object module, following the steps illustrated in FIG. 
3. 

Once the debug type index packing tool has been enabled 
to link time, the linker examines each type record in the 60 
debug type information to determine whether the record is 
degenerate (block 40). 

Where the debug information type record is not a degen- 
erate record, it must be matched to a global type (block 42). 
To match the record to a global type, cither a match of the 65 
parent type (i.e., class) and all sibling records or a match of 
the parent to a global degenerate record must be located 



(block 44). If the match is successful, the type references are 
remapped in local symbols to point to the global version of 
the type description (block 46) and the local type descrip- 
tions are discarded (block 48). 

If the match is not successful, a new type must be added 
to the global variable table (block 50) so that the local 
references can be remapped from the local type to use the 
global type instead (block 46, 48). 

Where the debug information type record is a degenerate 
reference, a match to a global type must be located, either to 
match to a global degenerate type record of the same name 
and type (block 52) or to a fully described type with the same 
type and name (block 54). If the match is successful, then the 
type reference in the local symbol is remapped to point to the 
global version of the type description (block 46) and the 
local symbol is discarded (block 48). If not, then a new type 
must be added to the global type table (block 50) so that the 
local symbol can be remapped to a global version and the 
local type description discarded (blocks 46, 48). 

The resulting execu tables and objects containing debug- 
ging information are of minimum size, and the compilation 
cost of producing the debugging information is drastically 
reduced. 

In order to demonstrate these advantages of the use of the 
present invention, a simple test was conducted. A directory 
containing six pre-compiled object modules with a number 
of header files from a visual building tool was built in an 
IBM OS/2 system (50 MHz 486 24 MB). 

The first test build was done without any debugging 
information being compiled currently, to establish a calibra- 
tion. 



45 



50 



Test 1: 


Start 


17:33:23 


End 


17:35:21 


Duration 


1 :58 seconds 


5:33 pm 


18061 hytcs A.obj 


5:33 pm 


6418 B.obj 


5:34 pm 


4963 C.obj 


5:34 pm 


38569 D.obj 


5:35 pm 


9739 E.obj 


5:35 pm 


1953 F.obj 


6 flics 79703 bytes used 




l"he second test build 


1 was performed with the conven- 


tional debugging information option activated. 


Ibsl 2: 


Start 


17:35:22 


End 


17:37:48 


Duration 


2:26 seconds 


5:35 pm 


118750 bytes A.Qbj 


5:36 pm 


72639 B.obj 


5:36 pm 


41719 C.obj 


5:36 pm 


219691 D.obj 


5:37 pm 


109410 E.obj 


5:37 pm 


26131 F.obj 



6 files 588340 bytes used 

In the third build, the minimum debugging option of the 
present invention was activated. 
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Test 3: 


Start 


17:37:49 


EDd 


17:39:51 


Duration 


2:02 seconds 


5:38 pm 


49238 bytes A.obj 


5:38 pm 


25241 B.obj 


5:38 pm 


18201 Cobj 


5:39 pm 


102882 D.obj 


5:39 pm 


37052 E.obj 


5:39 pm 


8605 Robj 



6 files 241219 bytes used 



From Ihe foregoing, it can be seen that the second build, 
using the conventional debug information compilation tech- 
nique added 25% to build lime and 640% to object size, 
while the minimum debug collection option of the present 
invention reduced the time increase to 4% and the object size 
increase to 200%. 

As an alternative to the above described preferred 
embodiment, the library could be eliminated for any cases 
where the class hljrary itself is only in one library file. 
However, this would not be as effective for an operating 
system, such as IBM's OS/2, in which support for both static 
and dynamic linking is required and where there would be 
a library for static linking and an import library for dynamic 
linking. Two copies of the large fiill debugging information 
would also be required. 

Also, the reference could be implemented simply as: 

extern int ClassLibraryTypelnfo 

This could not be used, however, in a compiler that has 
been enhanced to remove false references to avoid drawing 
unnecessary objects into the link without further enhance- 
ment. For example, if the back end of the compiler has been 
modified to permit removal of references only where the 
code or data artifacts that make the references are also 
removed, then references having no code or data artifacts 
referring to them in the first place will not be removed. The 
reference cited above would have no code or data artifacts, 
and it would not be removed. 

Further modifications to the invention that would be 
obvious to those skilled in the art are intended to be covered 
within the scope of the appended claims. 

What is claimed is: 

1. A method for providing debugging information for a 
plurality of types in an object-oriented computer program- 
ming system, the method comprising the steps of: 

providing a class library for the at least one type of the 
plurality of types, the class library capable of being 
dynamically linked without full debugging informa- 
tion; 

providing a control directive to a linker to add a library 
containing an object module with the full debugging 
type information for the at least one type; 

providing a reference to an external symbol defined in the 
object module for the at least one type; 

providing a full description of the at least one type in a 
distinguished compile unit for the at least one type; and 

providing a degenerate description for the at least one 
type. 

2. The method of claim 1 wherein each of the at least one 
type has a unique identifier and the degenerate description 
further includes a unique identifier of that type. 

3. 'llie method of claim 2 wherein each of the at least one 
type is a global type and the unique identifier includes a fully 
qualified type name for the global type. 



J6,786 Bl 

8 

4. The method of claim 2 further comprising the step of: 
linking the object module, the linking step further includ- 
ing the steps of: 

enabling a debugging type index packing tool; 
^ resolving the reference to the external symbol by 
linking the object module; and 

resolving the degenerate description to a full descrip- 
tion for the at least one type obtained from the 

5. The method of claim 2 wherein the at least one type 
further includes a type implemented in a binary class library. 

6. A computer-readable medium containing a program for 
providing debugging information for a plurality of types, the 

15 program including instructions for: 

providing a control directive to a linker to add a library 
containing an object module with full debugging type 
information for the at least one type, the at least one 
type of the plurahty of types being in a class library, the 
2^ class library capable of being dynamically linked with- 
out full debugging information; 
providing a reference to an external symbol defined in the 
object module; 

25 providing a full description of the at least one type in a 
distinguished compile unit for the at least one type; and 
emitting a degenerate description for the at least one type. 

7. The computer-readable medium of claim 6 wherein the 
program instructions further include instructions for: 

enabling a debugging type index packing tool; 
resolving the reference to the external symbol by linking 

the object module; and 
resolving the degenerate descriptions to full descriptions 
35 for the type obtained from the library. 

8. The computer-readable medium of claim 7 wherein the 
at least one type further includes a type implemented in a 
binary class library. 

9. A compiler adapted to compile object oriented pro- 
grams and include debugging information for a plurality of 
types being compiled, the compiler comprising: 

means for providing a full description of the at least one 
type in a distinguished compile unit for the at least one 

45 type and otherwise emitting a degenerate description 
for the at least one type, the at least one type of the 
plurality of types being in a class library, the class 
library capable of being dynamically linked without 
full debugging information; 

50 meaas to direct a linker to add to a plurality of link 
libraries a library containing an object module includ- 
ing full debugging type information for the at least one 
type; and 

means to direct the linker to resolve the degenerate 
description by accessing the library containing the full 
debugging type information for the at least one type. 

10. The compiler of claim 9 wherein the means to direct 
the linker further include: 

(JO means to embed a control directive to the linker to add a 
library identified in a header file in a class library. 

11. The compiler of claim 9 wherein the means to direct 
the linker to resolve the degenerate description further 
include: 

65 means for providing a reference to an external symbol 
defined in the library containing full debugging type 
information. 
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12. The compiler of claim 9 wherein the at least one type 
further includes a type implemented in a binary class library. 

13. The method of claim 1 wherein the full description, 
the degenerate reference or both provide the full debugging 
type information for the class library through the object 5 
module, the control directive and the reference to the exter- 
nal symbol. 

14. The computer-readable medium of claim 6 wherein 
the full description, the degenerate reference or both provide 
the full debugging type information for the class library 



through the object module, the control directive and the 
reference to the external symbol. 

15. The system of claim 9 wherein the full description, the 
degenerate reference or both provide the full debugging type 
information for the class library through the object module, 
the control directive and the reference to the external sym- 
bol. 
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